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We performed a comparative dynamical investigation of chaotic test data using the global flow reconstruction method. We 
demonstrate that large-amplitude, long-term variations may have a disturbing effect in the analysis. The Empirical Mode 
Decomposition method (EMD) and the Fourier filtering were tested to remove the additional variations. Test results show 
that the elimination of these variations significantly increased the robustness of the reconstructions. 
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1 Introduction 

Dynamical investigation of chaotic variable stars have al- 
ready been performed on a few red giant or supergiant vari- 
able stars. Most of them belong to the semiregular class 
(Buchler, Kollath & Cadmus 2004), but RV Tauri- (Buchler 
et al. 1996; Kollath et al. 1998) and Mira-type stars (Kiss & 
Szatmary 2002) show chaotic behaviour as well. A compre- 
hensive investigation of these classes of stars may recover 
more candidates. 

Determination of quantitative properties require long 
and quasi-continuous data set of the light variation. Ade- 
quate data is available: the AAVSO and similar databases 
provide a large collection of visual observations spanning 
more than hundred years. Complemented with survey ob- 
servations like ASAS and NSVS data, we can obtain time 
series that are useful for dynamical studies. 

A suitable analyser tool, the global flow reconstruction 
method (Buchler & Kollath 2001) have been used in the 
studies of the chaotic stars mentioned above (with the ex- 
emption of the Mira-type star.) The method is able to detect 
low dimensional chaos. However, a large-amplitude, long- 
term variation which is independent of the chaotic dynam- 
ics may increase the minimum embedding dimension of the 
system, reaching the undetectable level. This kind of varia- 
tion is often visible in observational data, but the origin is 
not clear in many cases. The so-called long secondary pe- 
riods, variations over some 10 times the pulsation period, 
have been observed in several semiregular stars (Woods, 
Olivier & Kawaler 2004). Most proposed mechanisms are 
independent of the pulsation itself, e.g. stellar rotation (Kiss 
et al. 2000), chromospheric activity (Woods et al. 2004), 
giant cell turnover (Stothers 2010), or binarity (Derekas et 
al. 2006). RV Tauri stars may also have orbital companions 
and circumstellar dust disks (Van Winckel et al. 1998). We 
can not exclude the possibility of observational bias either. 
The use of different comparison stars may lead to deviations 
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both in the observed average luminosity and the amplitude 
of the pulsation. The latter is crucial in our analysis, there- 
fore it must be investigated before performing dynamical 
investigations. 

Taking the assumption that long-term variations are not 
connected to the oscillation of the star, we consider re- 
moving these components from the light curves. Neverthe- 
less we believe that detailed tests have to be done before 
performing such conversion of observational data because 
chaotic signals may also have low frequency components 
which, if removed, distort the signal itself. 

Fitting of long-term components is not unambiguous 
since they are often complex in nature (i.e. neither sinu- 
soidal nor smooth trends). Therefore we tested two different 
methods to eliminate them: the Empirical Mode Decompo- 
sition (EMD) method and the Fourier filtering. The basic 
difference between the methods is not only the empirical 
and the analytical approach, but that the EMD process is 
optimized to handle the time series itself while the latter is 
to the Fourier transform of it. 

We applied the dynamical analysis on a test data with 
an added long-term variation, and we repeated it on vari- 
ous cleaned data. In this paper we present the results of this 
comparison. 

2 Data preparation 

In our previous test paper on global flow reconstruction 
(Plachy & Kollath 2010) we presented an analysis of a test 
data generated by coupling two Rossler oscillators. (Equa- 
tions and parameters can be found in that paper). That data 
set turned out to be an excellent imitation of chaotic light 
curves, so we decided to use it in this analysis as well. 

For the long-term variation component we chose a few 
cycles long fraction of the time series of the chaotic cou- 
pled Rossler oscillators. It was amplified and stretched to 
the length of the original data. This ensured the long-term 
variation component to be also chaotic, i.e. not constant in 
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Fig. 1 Composition of TEST1 data. The original and the 
long-term variation data are both generated with coupled 
Rossler oscillators (Plachy & Kollath 2010). TEST1 is the 
sum of the two. Time is in arbitrary units. 
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Fig. 2 Fourier transform and Broomhead-King projec- 
tions of TEST1. Frequency is in arbitrary units. 



frequency and amplitude. TEST1 data set is the simple sum 
of the Rossler and the long-term component (Figure [T). 

TEST2 data was created by removing the extra variation 
from TEST1 with the EMD method. This method was de- 
veloped by Huang et al. (1998) and it is used in analysis of 
nonlinear and nonstationary processes. It is rarely applied 
in the field of astronomy; however, it showed up in recent 
studies of solar activity (e.g. Vecchio et al. 2012; Li et al. 
2012). The decomposition is based on the local character- 
istic time scale of the data and results in a finite number of 
intrinsic mode functions (IMFs). The IMFs are amplitude- 
and frequency-modulated components. Their time scale of 
variation increases at the successive steps of the decompo- 
sition process. This nature of the technique can be used for 
eliminating long-term variations from the data, by creating 
an artificial signal as the sum of the first few IMFs. 

The number of required IMFs is not trivial. In our case 
the sum of the residual IMFs resembled the long-term vari- 
ation the most, when we used the first five IMFs (Figure [3). 
However, the exact shape of the extra variation is not known 
in real data, therefore we investigated a different solution as 
well: TEST3 is the sum of the first three IMFs. 

TEST4 was created using the Fourier filtering tech- 
nique. We applied a sharp tangent hyperbolic filter function 
with a midpoint at 5 • 10~ 4 frequency value. We display the 
low frequency region of Fourier-spectra of TEST2, TEST3, 
TEST4 and of the removed variations in Figure @] They are 
comparable to the original Rossler data and the long-term 
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Fig. 3 The first five intrinsic mode functions (IMFs) of 
TEST1 data. The lowest two panels show the sums of the 
residual IMFs, after separating the first five and three IMFs. 
TEST2 is the sum of the first five IMFs, TEST3 is the sum 
of the first three IMFs. 



variation we added. If we use less IMFs than it is required, 
false peaks may appear in the Fourier-spectrum, as in the 
case of the residual of TEST3. Since the EMD method is 
optimized in the time domain, the neighbouring IMFs over- 
lap each other in the frequency space and they may contain 
similar components with opposite phases. 

3 Analysis 

We applied the global flow reconstruction on all four test 
data. Here we only provide the basic steps of the method 
and we refer to Buchler & Kollath (2001) for a detailed de- 
scription. The input data is converted to so called "delay 
vectors", which represent the flow in the phase space. We 
search for an operator (map) that connects the neighbouring 
delay vectors. Once the map has been found, we are able to 
iterate synthetic output data that will be compared to the in- 
put. If they are similar, the reconstruction can be considered 
successful. 

Comparison methods can not be analytical since the 
data is chaotic. Instead, we use different representations 
of the data which are compared visually. According the 
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Fig. 4 Left panels: Comparison of the Fourier-spectra of 
the extra variation and the variations that were removed in 
each test. Right panels: The low frequency region of the 
original Rossler and the different test data. Frequency is in 
arbitrary units. 

experiences from previous investigations (e.g. Buchler 
et al. 2004), the overall shape of the time series, the 
Fourier Transform, and the Broomhead-King (BK) projec- 
tion (Broomhead & King 1987) are useful visualisations, 
all of which must be similar to the input data. In the case 
of a successful reconstruction, the quantitative properties of 
synthetic data are good representations of the original data. 

We determine the Lyapunov exponents and dimension 
of the synthetic signal. Error estimation is performed with a 
Monte Carlo-like simulation: we expand the search of maps 
in the close neighbourhood of the phase space trajectory by 
adding a small amount of noise to the input data and then 
applying a cubic spline for smoothing. The delay parameter 
is also variable in a certain range depending on the sam- 
pling of the data. Expansion of the embedding dimension 
(up to 6D, the largest the dimension that the implementa- 
tion of the method can handle) gives extra solutions. The 
following parameter values were set in our tests: noise in- 
tensity: 0.001, 0.002, 0.003; smoothing parameter: 0.000, 
0.001, 0.002; and time delay: 5-25. The embedding dimen- 
sions were 5 and 6. The outcome of the reconstructions can 
be diverse: both chaotic and periodic, as well as fix point at- 
tractors can be found, but some iterations may also diverge 
(blow up). The ratio of successful chaotic maps is a good 
measure of the robustness of the reconstruction. 

4 Results 

The synthetic signals of TEST1 show some sign of ex- 
tra variation in 40% of the 5D and in 82% of the 6D re- 
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Fig. 5 An unsuccessful reconstruction of TEST1 data. 
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Fig. 6 An acceptable reconstruction of TEST1 data, 
showing mid-term variations that are faster than the long- 
term component in TEST1 (FigureHJ. 

constructions. Most of them do not resemble the original, 
they rather display irregular jumps between two oscillation 
states. A typical example is shown in Figure[5] In the cases 
where synthetic signals resemble somewhat the original and 
a long-term variation is also present, it is always shifted to 
higher frequencies (Figure |6}. Because of these effects, se- 
lection of "good" maps of TEST1 was more difficult than in 
the other cases. 

We summarized the results in Table 1. We listed the 
number of chaotic maps obtained from the reconstruction, 
the number of successful chaotic maps with percentage val- 
ues in brackets and the average Lyapunov dimension val- 
ues (Dl) of the successful maps. The calculated errors are 
standard deviations. All Lyapunov dimension values are in 
agreement with the results of the reconstruction of the origi- 
nal coupled Rossler data: 3.03 ± 0.30 in 5D and 3.11 ± 0.31 
in 6D values (Plachy & Kollath 2010). After the elimina- 
tion there was no drastic change in the numbers of chaotic 
maps, but the number of successful chaotic maps increased 
significantly. 

5 Conclusion 

We studied the effects of long-term variation in chaotic data 
sets with the global flow reconstruction method. The ef- 
fectiveness of the Empirical Mode Decomposition (EMD) 
method and the Fourier filtering to remove such variations 
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Table 1 Results of the reconstructions. 
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from the input data was investigated. The number of chaotic 
synthetic signals we obtained from the reconstructions were 
similar in all cases, between 140 and 189. However, the per- 
centages of successful chaotic maps show significant differ- 
ences, making the reconstruction much more robust (three 
times more successful) if the elimination is applied. The 
results suggest that removal of high-amplitude, long-term 
variations can help to detect chaos in light curves, espe- 
cially if only a few successful maps can be found otherwise. 
The EMD method was used for the first time for elimination 
of long-term variation in the context of dynamical analysis 
with global flow reconstruction. We found that the EMD 
method is as suitable for such application as the Fourier fil- 
tering. 
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